Exact Learning of Tree Patterns
نویسندگان
چکیده
Tree patterns are natural candidates for representing rules and hypotheses in many tasks such as information extraction and symbolic mathematics. A tree pattern is a tree with labeled nodes where some of the leaves may be labeled with variables, whereas a tree instance has no variables. A tree pattern matches an instance if there is a consistent substitution for the variables that allows a mapping of subtrees to matching subtrees of the instance. A nite union of tree patterns is called a forest. In this thesis, we study the learnability of tree patterns from queries when the subtrees are ordered or unordered. The learnability is determined by the semantics of matching as de ned by the types of mappings from the pattern subtrees to the instance subtrees. Exact supervised learning is used. We rst show that ordered tree patterns and forests, with an in nite label alphabet (or equivalent condition), are learnable from equivalence (and membership) queries. Ordered forests and similar classes are shown to be as hard to learn as DNF without an in nite label alphabet or equivalent. We next show that unordered tree patterns and forests are not exactly learnable from equivalence and subset queries when the mapping between subtrees is one-to-one onto, regardless of the computational power of the learner. Tree and forest patterns are learnable from equivalence and membership queries for the one-to-one into mapping. Finally, we connect the problem of learning tree patterns to inductive logic programming by describing a class of tree patterns called Clausal trees that includes nonrecursive ingle-predicate Horn clauses and show that this class is learnable from equivalence and membership queries. Integrate divide / \ ---> / \ ^ x / \ / \ / + x n ^ / \ / \ n 1 x + / \ n 1 Figure 0.1: Simple Integration Rule Integrate Integrate Integrate / \ / \ / \ ^ x ^ x ^ x / \ / \ / \ x 3 x 5 x n (a) (b) (c) Figure 0.2: Simple Learning Illustration (Integration/Ordered Trees) Acknowledgments This research was partially supported by the NSF under grant number IRI-9520243. We thank Dana Angluin, Lisa Hellerstein, Roni Khardon, Stephen Kwek, David Page, Vijay Raghavan, and Chandra Reddy for interesting discussions on the topic of this paper. We thank the reviewers for many excellent suggestions. 1 A A A / \ / \ / \ B C B C z y / \ / \ | D E F G H (a) (b) Examples Pattern Figure 0.3: Learning with Abstract Trees 2 Chapter
منابع مشابه
Which is effective: self-directed learning or tutor-directed learning on the level of nursing skills
Introduction. This is quasi experimental research in order to determine and compare the learning level of nursing skills ( in B.A students) with self-directed learning and tutor-directed learning pattern in Shaheed Beheshti Univeristy of Medical Sciences and Health Services, Nursing and Midwifery faculty, 1998-1999. Methods. First of all, a questionnaire composed of some demographic data such ...
متن کاملUsing an Adaptive Search Tree to Predict User Location
In this paper, we propose a method for predicting a user’s location based on their past movement patterns. There is no restriction on the length of past movement patterns when using this method to predict the current location. For this purpose, a modified search tree has been devised. The search tree is constructed in an effective manner while it additionally learns the movement patterns of a u...
متن کاملنقش درخت زندگی در فرش های ترکمنی (با تاکید بر نقوش درخت در فرهنگ اسلامی و تمدن های باستانی)
In Islam the “Tree of Life” is named as Sedreh or Tuba and the followers of Islam believe that this tree is grown in Heaven; therefore it is an interesting subject for artistic innovative. In Turkmen terminology “tree of life” is called “Yashaish bagh”. In this study we have made an effort to evaluate the symbol of the Tree and the “Tree of Life”,...
متن کاملComplexity of Equivalence and Learning for Multiplicity Tree Automata
We consider the complexity of equivalence and learning for multiplicity tree automata, i.e., weighted tree automata over a field. We first show that the equivalence problem is logspace equivalent to polynomial identity testing, the complexity of which is a longstanding open problem. Secondly, we derive lower bounds on the number of queries needed to learn multiplicity tree automata in Angluin’s...
متن کاملMMDT: Multi-Objective Memetic Rule Learning from Decision Tree
In this article, a Multi-Objective Memetic Algorithm (MA) for rule learning is proposed. Prediction accuracy and interpretation are two measures that conflict with each other. In this approach, we consider accuracy and interpretation of rules sets. Additionally, individual classifiers face other problems such as huge sizes, high dimensionality and imbalance classes’ distribution data sets. This...
متن کامل